Overview

Dataset statistics

Number of variables13
Number of observations323
Missing cells0
Missing cells (%)0.0%
Duplicate rows4
Duplicate rows (%)1.2%
Total size in memory32.9 KiB
Average record size in memory104.4 B

Variable types

NUM11
CAT2

Warnings

Dataset has 4 (1.2%) duplicate rows Duplicates

Reproduction

Analysis started2020-12-15 16:22:55.855267
Analysis finished2020-12-15 16:23:13.300593
Duration17.45 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

Time
Real number (ℝ≥0)

Distinct164
Distinct (%)50.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean109.5356037
Minimum0
Maximum236
Zeros2
Zeros (%)0.6%
Memory size2.5 KiB

Quantile statistics

Minimum0
5-th percentile12.1
Q151.5
median103
Q3163
95-th percentile220.9
Maximum236
Range236
Interquartile range (IQR)111.5

Descriptive statistics

Standard deviation66.29667793
Coefficient of variation (CV)0.6052523169
Kurtosis-1.11736834
Mean109.5356037
Median Absolute Deviation (MAD)54
Skewness0.2164662277
Sum35380
Variance4395.249505
MonotocityIncreasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
14551.5%
 
14051.5%
 
19051.5%
 
20951.5%
 
4441.2%
 
7841.2%
 
2641.2%
 
9541.2%
 
10341.2%
 
4141.2%
 
Other values (154)27986.4%
 
ValueCountFrequency (%) 
020.6%
 
120.6%
 
220.6%
 
410.3%
 
720.6%
 
ValueCountFrequency (%) 
23610.3%
 
23510.3%
 
23410.3%
 
23120.6%
 
23010.3%
 

V1
Real number (ℝ)

Distinct316
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.2455680059
Minimum-6.093247805
Maximum1.492935977
Zeros0
Zeros (%)0.0%
Memory size2.5 KiB

Quantile statistics

Minimum-6.093247805
5-th percentile-2.781467086
Q1-0.927523034
median-0.370562536
Q31.104234444
95-th percentile1.293359904
Maximum1.492935977
Range7.586183782
Interquartile range (IQR)2.031757478

Descriptive statistics

Standard deviation1.432845175
Coefficient of variation (CV)-5.834820256
Kurtosis2.248500516
Mean-0.2455680059
Median Absolute Deviation (MAD)1.124105305
Skewness-1.230463585
Sum-79.31846589
Variance2.053045295
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1.03837033441.2%
 
-0.53538776320.6%
 
-2.41948562620.6%
 
-2.42041282420.6%
 
-0.52991228420.6%
 
1.2490547210.3%
 
-0.74287714610.3%
 
0.80568180410.3%
 
-4.2575974410.3%
 
1.21140602410.3%
 
Other values (306)30694.7%
 
ValueCountFrequency (%) 
-6.09324780510.3%
 
-5.85246510810.3%
 
-5.40125766310.3%
 
-5.28597051310.3%
 
-5.10187714410.3%
 
ValueCountFrequency (%) 
1.49293597710.3%
 
1.47877285310.3%
 
1.44904378110.3%
 
1.43105340910.3%
 
1.38639697410.3%
 

V2
Real number (ℝ)

Distinct316
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1946914427
Minimum-12.11421274
Maximum5.26737615
Zeros0
Zeros (%)0.0%
Memory size2.5 KiB

Quantile statistics

Minimum-12.11421274
5-th percentile-1.335096003
Q1-0.17468001
median0.266150712
Q30.873891581
95-th percentile1.757362168
Maximum5.26737615
Range17.38158889
Interquartile range (IQR)1.048571591

Descriptive statistics

Standard deviation1.333692698
Coefficient of variation (CV)6.850289257
Kurtosis26.05485201
Mean0.1946914427
Median Absolute Deviation (MAD)0.519753177
Skewness-3.081771684
Sum62.88533599
Variance1.778736214
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.12748612741.2%
 
0.86526780820.6%
 
0.87389158120.6%
 
1.94788538520.6%
 
1.94934570420.6%
 
1.89702180410.3%
 
2.14272151610.3%
 
1.0434593310.3%
 
0.87808388210.3%
 
1.0225669910.3%
 
Other values (306)30694.7%
 
ValueCountFrequency (%) 
-12.1142127410.3%
 
-5.93171744110.3%
 
-5.45014783410.3%
 
-5.08121516210.3%
 
-4.42918377510.3%
 
ValueCountFrequency (%) 
5.2673761510.3%
 
4.84732319710.3%
 
4.39023019910.3%
 
2.60013820210.3%
 
2.35298430610.3%
 

V3
Real number (ℝ)

Distinct316
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.8358074658
Minimum-5.694973183
Maximum3.772856852
Zeros0
Zeros (%)0.0%
Memory size2.5 KiB

Quantile statistics

Minimum-5.694973183
5-th percentile-0.8662009686
Q10.261188651
median0.839421415
Q31.493069935
95-th percentile2.49215913
Maximum3.772856852
Range9.467830035
Interquartile range (IQR)1.231881285

Descriptive statistics

Standard deviation1.061107448
Coefficient of variation (CV)1.269559667
Kurtosis4.785610737
Mean0.8358074658
Median Absolute Deviation (MAD)0.626269539
Skewness-0.9079029733
Sum269.9658115
Variance1.125949017
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.18445588841.2%
 
0.55299766120.6%
 
1.34724732920.6%
 
1.35107628820.6%
 
0.55364604320.6%
 
-0.01895310410.3%
 
1.26727718510.3%
 
0.30986729110.3%
 
2.49571533710.3%
 
0.16648011310.3%
 
Other values (306)30694.7%
 
ValueCountFrequency (%) 
-5.69497318310.3%
 
-3.45803355510.3%
 
-2.36048306110.3%
 
-1.76189493510.3%
 
-1.59124167210.3%
 
ValueCountFrequency (%) 
3.77285685210.3%
 
3.5627703410.3%
 
3.40258500410.3%
 
3.35071744810.3%
 
2.97477937510.3%
 

V4
Real number (ℝ)

Distinct316
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3618770982
Minimum-4.515824355
Maximum4.075817303
Zeros0
Zeros (%)0.0%
Memory size2.5 KiB

Quantile statistics

Minimum-4.515824355
5-th percentile-1.747107623
Q1-0.4322320825
median0.487834384
Q31.179209079
95-th percentile2.595465841
Maximum4.075817303
Range8.591641658
Interquartile range (IQR)1.611441161

Descriptive statistics

Standard deviation1.290396378
Coefficient of variation (CV)3.565841508
Kurtosis0.6792490699
Mean0.3618770982
Median Absolute Deviation (MAD)0.761860364
Skewness-0.137606802
Sum116.8863027
Variance1.665122811
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1.10994979141.2%
 
0.14545667720.6%
 
0.14757547420.6%
 
0.98306888620.6%
 
0.98271009620.6%
 
1.47528945910.3%
 
1.29334959710.3%
 
0.6533349410.3%
 
0.54447398910.3%
 
-0.53542107310.3%
 
Other values (306)30694.7%
 
ValueCountFrequency (%) 
-4.51582435510.3%
 
-3.15190785710.3%
 
-2.80748189110.3%
 
-2.73727134610.3%
 
-2.68279898810.3%
 
ValueCountFrequency (%) 
4.07581730310.3%
 
3.96056779610.3%
 
3.71006135910.3%
 
3.48148583410.3%
 
3.31138504610.3%
 

V5
Real number (ℝ)

Distinct316
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.03317523672
Minimum-6.631950832
Maximum7.672543966
Zeros0
Zeros (%)0.0%
Memory size2.5 KiB

Quantile statistics

Minimum-6.631950832
5-th percentile-1.479057783
Q1-0.5670254535
median-0.051269792
Q30.4899611265
95-th percentile2.532000185
Maximum7.672543966
Range14.3044948
Interquartile range (IQR)1.05698658

Descriptive statistics

Standard deviation1.254325101
Coefficient of variation (CV)37.80907765
Kurtosis7.559613146
Mean0.03317523672
Median Absolute Deviation (MAD)0.533694205
Skewness0.3448660653
Sum10.71560146
Variance1.57333146
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.441698941.2%
 
-0.28151806620.6%
 
0.41420885820.6%
 
0.43368021220.6%
 
-0.28481527520.6%
 
0.42246515310.3%
 
-0.09215762810.3%
 
-1.36016210910.3%
 
0.4264102910.3%
 
0.15474775710.3%
 
Other values (306)30694.7%
 
ValueCountFrequency (%) 
-6.63195083210.3%
 
-5.51775792210.3%
 
-3.80378772110.3%
 
-3.32308406410.3%
 
-2.5855230410.3%
 
ValueCountFrequency (%) 
7.67254396610.3%
 
3.28197152610.3%
 
3.22881956110.3%
 
3.04910587810.3%
 
3.00222377410.3%
 

V6
Real number (ℝ)

Distinct316
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3714670934
Minimum-2.145672699
Maximum5.122102581
Zeros0
Zeros (%)0.0%
Memory size2.5 KiB

Quantile statistics

Minimum-2.145672699
5-th percentile-0.9959931713
Q1-0.499897258
median0.062291102
Q30.619268285
95-th percentile3.640598257
Maximum5.122102581
Range7.26777528
Interquartile range (IQR)1.119165543

Descriptive statistics

Standard deviation1.346730027
Coefficient of variation (CV)3.625435607
Kurtosis1.727730536
Mean0.3714670934
Median Absolute Deviation (MAD)0.568001468
Skewness1.46018184
Sum119.9838712
Variance1.813681766
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.94528252741.2%
 
0.10022309420.6%
 
0.08698293820.6%
 
2.40895755220.6%
 
2.41119959220.6%
 
1.80049938110.3%
 
0.03237591710.3%
 
-0.39750630710.3%
 
-0.70327452410.3%
 
-0.03225480710.3%
 
Other values (306)30694.7%
 
ValueCountFrequency (%) 
-2.14567269910.3%
 
-1.76340557410.3%
 
-1.64525343110.3%
 
-1.43872833210.3%
 
-1.43750217610.3%
 
ValueCountFrequency (%) 
5.12210258110.3%
 
5.05181166310.3%
 
4.77600042510.3%
 
4.09950648210.3%
 
4.09191538310.3%
 

V26
Real number (ℝ)

Distinct316
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.01133634702
Minimum-1.243924154
Maximum3.065575697
Zeros0
Zeros (%)0.0%
Memory size2.5 KiB

Quantile statistics

Minimum-1.243924154
5-th percentile-0.5741906337
Q1-0.316004533
median-0.111062585
Q30.20007148
95-th percentile0.8873561269
Maximum3.065575697
Range4.309499851
Interquartile range (IQR)0.516076013

Descriptive statistics

Standard deviation0.4781864425
Coefficient of variation (CV)-42.18170471
Kurtosis5.019502926
Mean-0.01133634702
Median Absolute Deviation (MAD)0.244313055
Skewness1.345642475
Sum-3.661640088
Variance0.2286622738
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
-0.16693683641.2%
 
-0.55290360320.6%
 
-0.31360687520.6%
 
-0.55347096920.6%
 
-0.3137029520.6%
 
0.19996365810.3%
 
-0.28109720610.3%
 
-0.09124154110.3%
 
-0.94367790610.3%
 
-0.55147459610.3%
 
Other values (306)30694.7%
 
ValueCountFrequency (%) 
-1.24392415410.3%
 
-0.98199291410.3%
 
-0.96061051310.3%
 
-0.94367790610.3%
 
-0.83520330710.3%
 
ValueCountFrequency (%) 
3.06557569710.3%
 
1.37403121110.3%
 
1.28620050610.3%
 
1.19654919910.3%
 
1.19492806410.3%
 

V27
Real number (ℝ)

Distinct316
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.02111417404
Minimum-2.377932922
Maximum2.490503055
Zeros0
Zeros (%)0.0%
Memory size2.5 KiB

Quantile statistics

Minimum-2.377932922
5-th percentile-0.4685036046
Q1-0.0565647345
median0.019988218
Q30.100665432
95-th percentile0.3905326997
Maximum2.490503055
Range4.868435977
Interquartile range (IQR)0.1572301665

Descriptive statistics

Standard deviation0.3621914025
Coefficient of variation (CV)17.15394605
Kurtosis20.14409464
Mean0.02111417404
Median Absolute Deviation (MAD)0.078186256
Skewness1.042531538
Sum6.819878215
Variance0.131182612
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.08124671841.2%
 
-0.1882808820.6%
 
-0.18743124820.6%
 
-0.07830550220.6%
 
-0.07328808420.6%
 
0.24621930510.3%
 
-0.00630700410.3%
 
-0.35457919210.3%
 
0.26507824810.3%
 
-0.00460789810.3%
 
Other values (306)30694.7%
 
ValueCountFrequency (%) 
-2.37793292210.3%
 
-1.20692108110.3%
 
-1.12345654510.3%
 
-0.99616100410.3%
 
-0.89885765110.3%
 
ValueCountFrequency (%) 
2.49050305510.3%
 
2.46886710110.3%
 
1.72070651810.3%
 
0.95039272710.3%
 
0.85737319210.3%
 

V28
Real number (ℝ)

Distinct316
Distinct (%)97.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-0.02199846598
Minimum-1.64855313
Maximum1.575379799
Zeros0
Zeros (%)0.0%
Memory size2.5 KiB

Quantile statistics

Minimum-1.64855313
5-th percentile-0.4792822277
Q1-0.036570382
median0.021077628
Q30.0841168035
95-th percentile0.234478022
Maximum1.575379799
Range3.223932929
Interquartile range (IQR)0.1206871855

Descriptive statistics

Standard deviation0.2910797393
Coefficient of variation (CV)-13.23181987
Kurtosis11.27446771
Mean-0.02199846598
Median Absolute Deviation (MAD)0.062814962
Skewness-0.6319961238
Sum-7.10550451
Variance0.08472741463
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.00119157641.2%
 
0.02542737820.6%
 
0.11947193920.6%
 
0.02330704520.6%
 
0.11983098820.6%
 
0.09957893410.3%
 
0.00509362210.3%
 
0.00804669610.3%
 
0.09804815910.3%
 
0.02727121910.3%
 
Other values (306)30694.7%
 
ValueCountFrequency (%) 
-1.6485531310.3%
 
-1.32516429910.3%
 
-1.25554915610.3%
 
-1.10575115710.3%
 
-1.08533918810.3%
 
ValueCountFrequency (%) 
1.57537979910.3%
 
1.57308358410.3%
 
0.94959424610.3%
 
0.67289985310.3%
 
0.63226084110.3%
 

Amount
Real number (ℝ≥0)

Distinct251
Distinct (%)77.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean81.43851393
Minimum0.75
Maximum3828.04
Zeros0
Zeros (%)0.0%
Memory size2.5 KiB

Quantile statistics

Minimum0.75
5-th percentile1
Q16.345
median20.53
Q365.13
95-th percentile267.632
Maximum3828.04
Range3827.29
Interquartile range (IQR)58.785

Descriptive statistics

Standard deviation262.3371372
Coefficient of variation (CV)3.221290818
Kurtosis132.7962335
Mean81.43851393
Median Absolute Deviation (MAD)17.84
Skewness10.15751013
Sum26304.64
Variance68820.77355
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
182.5%
 
1.9872.2%
 
12.9961.9%
 
9.9961.9%
 
0.8961.9%
 
1051.5%
 
1.2941.2%
 
0.9941.2%
 
1.1841.2%
 
2.6930.9%
 
Other values (241)27083.6%
 
ValueCountFrequency (%) 
0.7510.3%
 
0.7620.6%
 
0.7810.3%
 
0.8961.9%
 
0.9941.2%
 
ValueCountFrequency (%) 
3828.0410.3%
 
1402.9510.3%
 
1142.0210.3%
 
937.6920.6%
 
919.610.3%
 

Class
Categorical

Distinct4
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
0
303 
2
 
9
1
 
6
3
 
5
ValueCountFrequency (%) 
030393.8%
 
292.8%
 
161.9%
 
351.5%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

M or F
Categorical

Distinct2
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
M
174 
F
149 
ValueCountFrequency (%) 
M17453.9%
 
F14946.1%
 
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
Histogram of lengths of the category

Length

Max length1
Median length1
Mean length1
Min length1

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

TimeV1V2V3V4V5V6V26V27V28AmountClassM or F
00-1.359807-0.0727812.5363471.378155-0.3383210.462388-0.1891150.133558-0.021053149.621M
101.1918570.2661510.1664800.4481540.060018-0.0823610.125895-0.0089830.0147242.691M
21-1.358354-1.3401631.7732090.379780-0.5031981.800499-0.139097-0.055353-0.059752378.661M
31-0.966272-0.1852261.792993-0.863291-0.0103091.247203-0.2219290.0627230.061458123.501M
42-1.1582330.8777371.5487180.403034-0.4071930.0959210.5022920.2194220.21515369.991F
52-0.4259660.9605231.141109-0.1682520.420987-0.0297280.1059150.2538440.0810803.671F
641.2296580.1410040.0453711.2026130.1918810.272708-0.2572370.0345070.0051684.992F
77-0.6442691.4179641.074380-0.4921990.9489340.428118-0.051634-1.206921-1.08533940.802F
87-0.8942860.286157-0.113192-0.2715262.6695993.721818-0.3841570.0117470.14240493.202F
99-0.3382621.1195931.044367-0.2221870.499361-0.2467610.0941990.2462190.0830763.682M

Last rows

TimeV1V2V3V4V5V6V26V27V28AmountClassM or F
313225-0.6088310.8768372.4957153.1386740.161264-0.1070990.0724010.099756-0.13652435.110M
314227-1.4653811.3821721.0197900.2503671.011414-1.281807-0.491529-0.689819-0.3293741.000M
3152271.120599-0.3133080.3953090.596756-0.4204120.3258620.579529-0.026439-0.00708229.950M
3162281.1057680.3330120.1841481.276948-0.078118-0.810949-0.6710690.0059130.02825245.300F
317230-0.4202700.8147601.5132421.2927820.1380530.223342-0.2551790.244131-0.01338479.180F
318231-0.9615070.7433511.173582-0.2225930.7405281.7399590.4112910.4973150.20749926.980F
3192310.2831000.8192841.0543090.348488-0.156817-0.5091690.1791930.0517910.1141211.980F
320234-0.6024830.4790890.549750-1.069814-0.5016870.214778-0.117581-0.0737100.05520442.810F
321235-0.663511-0.0444431.029253-2.498072-1.350085-0.798774-0.4239360.2519890.13395725.000M
322236-1.1690871.0962920.6696881.0525340.4224650.703450-0.1981040.0911070.0758840.990M

Duplicate rows

Most frequent

TimeV1V2V3V4V5V6V26V27V28AmountClassM or Fcount
1741.0383700.1274860.1844561.1099500.4416990.945283-0.1669370.0812470.0011921.180M3
026-0.5299120.8738921.3472470.1454570.4142090.100223-0.552904-0.0732880.0233076.140F2
2145-2.4194861.9493460.5529980.982710-0.2848152.411200-0.313607-0.1874310.1194726.740M2